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Abstract. The Mock Data Challenges (MLDCs) have the dual purpose of 
fostering the development of LISA data-analysis tools and capabilities and of 
demonstrating the technical readiness already achieved by the gravitational-wave 
community in distilling a rich science payoff from the LISA data. The first round 
of MLDCs has just been completed and the second-round data sets are being 
released shortly after this workshop. The second-round data sets contain radiation 
from an entire Galactic population of stellar-mass binary systems, from massive- 
black-hole binaries, and from extreme-mass-ratio inspirals. These data sets are 
designed to capture much of the complexity that is expected in the actual LISA 
data, and should provide a fairly realistic setting to test advanced data-analysis 
techniques, and in particular the global aspect of the analysis. Here we describe 
the second round of MLDCs and provide details about its implementation. 



1. Introduction 

The Laser Interferometer Space Antenna (LISA) is a spaceborne gravitational-wave 
(GW) laser interferometer for the observation of the low- frequency (« 0.1 mHz- 
1 Hz) GW sky (see [TJ [5] and Danzmann's contribution in this volume). LISA 
is an all-sky monitor with the capability of observing a variety of compact-object 
binary systems, with masses ranging from a fraction to millions of solar masses. 
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Moreover, LISA could discover GWs from entirely new classes of sources, such as exotic 
compact objects and relics from the early universe (see [TJ|3] and references therein). 
Although much relevant experience has already been gained in the analysis of GW 
data collected by ground-based detectors, the differences between space-based and 
ground-based observations present new difficulties and require novel solutions for data 
analysis. These differences include the complex LISA response, the confusion noise 
from Galactic and extra-Galactic binary populations, and the simultaneous presence 
of many weak and strong GW signals. It is important to tackle these new analysis 
problems early, in order to develop the tools and methods necessary for the maximum 
science exploitation of such a revolutionary data set. 

The LISA International Science Team (LIST) has embarked on a programme 
to foster the development and evaluate the technical readiness of data-analysis tools 
and capabilities for LISA. This programme goes under the name of Mock LISA Data 
Challenges (MLDCs). The MLDC Task Force| has been charged by the LIST to 
formulate challenge problems, develop standard models of the LISA spacecraft and 
orbits, and of GW sources, provide computing tools (e.g., LISA response simulators 
and source- waveform generators) , establish criteria for the evaluation of the responses 
to the challenges, and provide any technical support necessary to the challenge 
participants. These challenges are meant to be blind tests, but not really contests; 
the greatest scientific benefit stemming from them will come from the quantitative 
comparison of results, analysis methods and implementations. 

The first round of MLDCs [6] has just been completed. Details on the data 
sets, techniques developed for the analyses, and results are provided in the companion 
article in this volume [7j, and in the references cited there. In this short paper 
we describe the second round of MLDCs, which has just been released |S]. The 
Challenge-2 data sets represent a very significant increase in complexity with respect 
to those distributed for Challenge 1. More importantly, they contain the full set of 
key sources that are expected to make up the real LISA data, and therefore provide a 
realistic testbed for different data-analysis approaches. Since the instrument response, 
instrument noise, and waveform models adopted so far in the MLDCs employ various 
simplifications, much work will be needed beyond the completion of Challenge 2 to fully 
develop the data-analysis capabilities necessary for the mission, and future rounds of 
MLDCs will address progressively more realistic measurement scenarios and introduce 
more general gravitational waveforms. The details of the schedule of MLDCs from 
Challenge 3 onwards have not been decided yet, but we expect to release a third 
round of MLDCs in summer 2007. 

2. Mock LISA Data Challenges: Second round 

The Challenge-I data sets contained single sources, or sets of several (sa 20) signals 
well separated in parameter space; an exception were two data sets containing 50 
signals from Galactic binaries, concentrated in frequency bands of width 30 /iHz 
and 3 /xHz. Challenge I focused on only two signal classes: Galactic stellar-mass 
binaries and massive-black-hole (MBH) binary inspirals. Indeed, the key goal of this 
first round of challenges was the development and validation of source-specific data- 
analysis techniques for the sources featured in the LISA mission's minimum science 

X Full details regarding the Task Force activities, as well as links to relevant resources, are available 
on the Task Force wiki [1]. 
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requirements. Challenge 2 has a much more realistic flavour: it includes millions 
of Galactic binaries, as we expect in reality, in addition to multiple massive-black- 
hole binaries and extreme-mass-ratio inspirals (EMRIs). Thus, Challenge 2 requires 
analysts to tackle the global analysis of many simultaneous and superimposed signals, 
in the presence of the most complex sources (EMRIs) that we expect to observe with 
LISA. 

Challenge 2 includes five single-source data sets (Challenge 1.3) @ dedicated 
specifically to EMRIs, and two multi-source data sets (Challenge 2.1 and 2.2). The 
data sets are all ~ 2 years long: namely, they contain 2 22 data points sampled at a 
cadence of 15 s. More specifically, the data sets are as follows: 

• Challenge 1.3 consists of five data sets, each containing a GW signal from a single 
EMRI with signal-to-noise ratio (SNR^j] between 40 and 110; 

• Challenge 2.1 contains signals from (i) a full population of Galactic binary 
systems (about 26 million sources), including (ii) 25 "verification binaries"; 

• Challenge 2.2 contains signals from (i) a full population of Galactic binary 
systems (about 26 million sources), including (ii) 25 "verification binaries"; (iii) 
an undisclosed number (between 4 and 6) of MBH binaries with SNRs between 
~ 10 and ~ 2000 and different coalescence times (not all within the two years of 
observation); and (iv) 5 EMRIs, with SNRs between 30 and 100. 

Details about the ranges from which the source parameters were drawn randomly are 
given in table Q] Figure Q] shows a representative Challenge-2.2 data set (not used 
in the actual challenge). Since data sets 2.1 and 2.2 contain signals from millions 
of Galactic binaries, the Task Force has identified four restricted frequency bands on 
which analysis should concentrate first, and over which the evaluation of responses 
will be carried out in much more depth: these windows are 0.2985 mHz < / < 0.3015 
mHz, 0.9985 mHz < / < 1.0015 mHz, 2.9985 mHz < / < 3.0015 mHz, and 5.9985 
mHz < / < 6.0015 mHz. 

As in Challenge 1, we release both blind challenge data sets (with undisclosed 
source parameters) and training data sets (with public parameters selected randomly 
within the same ranges). The training data sets come in two flavors: "noisy" and 
noise-free, the latter containing exactly the same GW signal(s) as those present in the 
noisy set. In addition, to facilitate the development and testing of analysis schemes, 
training data sets 2.1 and 2.2 will contain the same realization of the Galaxy. This 
will not be the true for the blind sets. 

For each challenge data set the three TDI channels X, Y and Z are distributed 
through the MLDC website [8]. The MLDC data sets are encoded in a format 
implemented using XML (the extensible Markup Language), a simple, flexible and 
widely used text format related to HTML [S]. Extensive software libraries to handle 
XML are readily available. The XML implementation of the MLDC file format (known 
as lisaXML) is based on XSIL (the extensible Scientific Interchange Language) |10j . 
A number of dedicated software tools to read and write lisaXML files (for C/C++, 
Python, MATLAB, and for conversion to ASCII) were developed by the MLDC Task 
Force and are available in the LISAtools Subversion archive [IT]. The archive also 

§ These data sets are still identified as 1.3.X, since training data sets for EMRIs were distributed 
with Challenge 1; blind data sets are however being released only now. For consistency with previous 
documents, we maintain the original nomenclature. 

|| We compute SNRs for single unequal-Michelson TDI variables, selecting for each source the variable 
(X, Y, or Z) that yields the highest SNR. 
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Table 1. Summary of data sets and source-parameter ranges in Challenge 2. 
All angular parameters are drawn randomly from uniform distributions over the 
whole relevant range. Source distances are set to provide the SNR at which the 
signals were designed to appear. In this table, the time of coalescence t c is given 
relative to the time at the beginning of the observation (i.e., the time stamp of the 
first data sample); U[-,-\ stands for uniform distribution within the given range. 
Notice that EMRIs drawn from the same parameter ranges in data sets 1.3.X and 
2.2 differ in SNR. 



Dataset Sources 



Parameters 



1.3 



1.3.1 
1.3.2 
1.3.3 
1.3.4 
1.3.5 



EMRIs 



one source with 
one source with 
. one source with 
, one source with 
. one source with 



fi/M G £ [7(9.5, 10.5], S/M 2 £ [7(0.5,0.7] 
time at plunge £ [7[2 21 ,2 22 ] x 15 s 
eccentricity at plunge £ [/[0.15, 0.25] 
M/10 7 M Q £ [7(0.95,1.05], SNR £ [7(40, 110] 
A7/10 6 Af Q £ [7(4.75,5.25], SNR £ [7(70,110] 
A7/10 6 A7 Q £ [7(4.75,5.25], SNR £ [7(40,60] 
A7/10 6 A7 Q £ [7(0.95,1.05], SNR £ [7(70,110] 
M/10 6 A7 Q £ [7(0.95,1.05], SNR £ [7(40,60] 



2.1 



Galactic binaries 

~3x 10 7 sources 

Verification binaries 

25 sources 



drawn from population (see Sec 



parameters in XML file posted on [5] 



2.2 



Galactic binaries 

~ 3 x 10 7 sources 

Verification binaries 

25 sources 

MBH binaries 

. . . source n. 1 
. . . source n. 2 

. . . and 2-4 out of these 4: 
. . . source n. 3 
. . . source n. 4 
. . . source n. 5 
. . . source n. 6 

EMRIs 



. . . one source with 
. . . two sources with 
. . . two sources with 



drawn from population (see Sec 14. 3|) 

parameters in XML file posted on [8] 

m 1 /10 6 M Q £ 17(1,5], m 2 /m 1 £ [7(1,4] 
t c £ [7(60, 120] days, SNR - 2000 
t c £ [7(750, 780] days, SNR - 20 



t c £ C/[180,720] days, SNR - 1000 
t e £ 17(180, 720] days, SNR - 200 
t c £ C/[495,585] days, SNR - 100 
t c £ [7(810,840] days, SNR - 10 

M/M Q £ 17(9.5, 10.5], S/M 2 £ [7(0.5,0.7] 
time at plunge £ [7[2 21 ,2 22 ] x 15 s 
eccentricity at plunge £ [/[0.15, 0.25] 
Af/10 7 Af Q £ [7(0.95, 1.05], SNR £ [/[30, 100] 
A7/10 6 M Q £ [7(4.75,5.25], SNR £ [7(30,100] 
M/10 6 A7 Q £ [7(0.95, 1.05], SNR £ [7(30, 100] 
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Figure 1. A representative Challenge-2.2 data set, plotted as the fractional 
frequency fluctuation spectrum of the full TDI X signal, and separately of the 
contributions from instrument noise and from the individual sources (bundling all 
Galactic binary sources as the "Galaxy"). 

includes all the software used for the data production pipelines (see the Task Force 
wiki [4] for its usage), including the waveform-generation codes described in the next 
sections, as well as a variety of other software tools useful to analyze MLDC data sets. 

The actual generation of the data sets is the responsibility of one member of the 
Task Force who does not take part in the challenges. The deadline for submission of 
results (again through the MLDC website) is June 15th, 2007. The Task Force plans 
to process the results and provide an initial summary and evaluation of this round 
within about a month of receiving the results. 

3. Modeling of LISA: Pseudo-LISA 

We have developed a set of conventions to describe the LISA orbit and response, which 
constitute the pseudo-LISA adopted in Challenges 1 and 2. The pseudo-LISA orbits 
are obtained by truncating exact Keplerian orbits for a point mass orbiting the Sun 
to first order in the eccentricity (see the Appendix of 12J). The two simulators used 
for the generation of the data sets - the LISA Simulator [TJ] and Synthetic LISA [T3] 
- comply with these assumptions, and adhere to these conventions. In Solar-System 
Barycentric (SSB) coordinates (with the x axis aligned with the vernal point), we set 

x n — a cos a + a e (sin a cos a sin f3 n — (1 + sin 2 a) cos /3 n ) , 

y n = a sin a + a e (sin a cos a cos (3 n — (1 + cos 2 a) sin (3 n ) , (1) 

z n = — V3 a e cos(a — n ) , 
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Table 2. Common source parameters. Note that in the initial challenges we do 
not deal explicitly with the redshifting of sources at cosmological distances; thus, 
D is a luminosity distance, and the masses and frequencies of table [4] are those 
measured at the SSB, which are red/blue-shifted by factors (l + z)^ 1 with respect 
to those measured locally near the sources. 



Parameter 


Symbol 


Standard parameter name 


Standard unit 






(lisaXML descr.) 


(lisaXML descr.) 


Ecliptic latitude 





EclipticLatitude 


Radian 


Ecliptic longitude 


X 


EclipticLongitude 


Radian 


Polarization angle 


$ 


Polarization 


Radian 


Inclination 


i 


Inclination 


Radian 


Luminosity distance 


D 


Distance 


Parsec 



where (3 n = (n — 1) X 2ir/3 + X (with n = 1,2,3) is the relative orbital phase 
of the n-th spacecraft, a — 1 AU is the semi-major axis of the guiding center, 
a(t) = 27rf/(lyear) + K is its orbital phase, and t is time measured at the SSB. 
In this approximation, the spacecraft form a rigid equilateral triangle with side length 
L = 2\/3ae = 5 x 10 6 km for e = 0.00965. (In fact, the LISA Simulator and 
Synthetic LISA implement e 2 -accurate orbits, but the additional terms make very 
little difference to the instrument response.) The LISA Simulator parameters k and 
A (InitialPosition and InitialRotation in lisaXML) set the initial location and 
orientation of the LISA constellation; in Challenges 1 and 2, k = X = 0. This choice 
places LISA at the vernal point at time t = 0, with spacecraft 1 directly below the 
guiding center in the southern ecliptic hemisphere Sj] 

The one-way measurements between adjacent spacecraft that are necessary to 
build the LISA response to GWs are taken to be either the phase response (as 
employed in the LISA Simulator, see Sec. II of [12 ) or the fractional frequency response 
yf£ (as used in Synthetic LISA, see Sec. II B of [33]). The TDI Rosetta Stone [H] 
provides details for translations between index notations. The phase and fractional- 
frequency formalisms are equivalent, and are related by a simple time integration^ 

LISA employs Time-Delay Interferometry (TDI) to suppress the otherwise 
overwhelming laser phase noise (see [El [17\ [18] and references therein). TDI 
observables are constructed from time-delayed linear combinations of one-way 
measurements, and they represent synthesized interferometers where laser phase 
fluctuations move in closed paths across the LISA arms. More complicated paths 
are required to deal with the real-orbit variations of the armlengths, giving rise to the 
three TDI "generations." For the initial challenges, we adopt TDI 1.5 observables 
[17] HE]- In particular, the data sets are the unequal-arm Michelson observables X, 

% The mapping to the Synthetic LISA |13| parameters is rjo = re, £o = 3n/2 — re + A, sw < 0; the 
mapping to Cutler's 1996 model [14] is </>o = re, ao = 37r/4 + re — A. 

+ However, the LISA Simulator and Synthetic LISA adopt different sign conventions: is given by 
the local-laser phase minus the incoming-laser phase, while y s i r by the incoming-laser frequency minus 
the local-laser frequency. The final GW responses do end up being consistent, since the two simulators 
use different signs in their definition of the GW polarization tensors [see l4t ]. The LISA Simulator 
produces equivalent- strain data, with a nominal length of L n = 10 10 m; to convert equivalent strain 
to fractional frequency one needs to differentiate and multiply by L„/c. The additional factor of 2ir 
given in [6] for this conversion is incorrect. 
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Y, and Z denned in [TS]. Strictly speaking, TDI 2.0 would be required to completely 
cancel laser noise in a flexing LISA array, such as that modelled by the simulators; 
however, the increase in complexity between TDI 1.5 and 2.0 complicates the numerical 
treatment of one-way measurements, but it is negligible for the purpose of the GW 
response. For the sake of simplicity, Challenges 1 and 2 feature data sets of TDI 1.5 
observables with no laser noise. 

The model of the LISA instrument noise adopted in Challenge 2 is identical to 
that considered for Challenge 1. It includes contributions from optical noise (assumed 
white in phase), with one-sided spectral density 

5 o 1 / t 2 (/) = 20xl0- 12 mHz- 1 / 2 , (2) 

and from acceleration noise (assumed white in acceleration, but increasing as 1// 
below 10~ 4 Hz), with one-sided spectral density 

=3 x 10- 15 [l + (10- 4 Hz//) 2 ] 1/2 ms- 2 Hz- 1/2 . (3) 

As mentioned above, we do not model laser phase noise. The six optical noises and 
six acceleration noises (for the two optical benches on each spacecraft) are treated as 
independent Gaussian random processes with variances given by |2]) and ([3]), and are 
realized in practice with sequences of pseudo-random numbers. Specifically, Synthetic 
LISA generates independent Gaussian deviates (i.e., white noise) in the time domain, 
and then filters them digitally to obtain the desired spectral shape; the LISA Simulator 
generates independent Gaussian deviates in the frequency domain, multiplies them by 
S 1/2 (f), and FFTs to the time domain. 

4. Gravitational waveforms 

In this section we describe the conventions adopted to describe the gravitational 
waveforms and the assumptions made on the signals to construct the data sets. They 
are identical to those adopted in Challenge 1, but for this challenge we introduce two 
new ingredients: the model of the Galaxy used to simulate a population of stellar-mass 
binaries, and the kludge EMRI waveforms. 

4-1- Gravitational-wave polarizations 

The sky location of a GW source is described in J2000 ecliptic coordinates: to 
wit, the latitude (3 and longitude A, the latter measured from the vernal point, 
aligned with the x axis in our convention. Gravitational radiation travels along the 
direction k = — (cos (3 cos A, cos (3 sin A, sin /3), with surfaces of constant phase given by 
£ = t — k ■ x. In the transverse-traceless gauge, the gravitational strain tensor can be 
decomposed in two polarization states and h x ({;), and is given bjQ 

h (£) = [u ® u - v <g> v] + h x (0 [u ® v + v <g> u] , (4) 

where u = dk/d/3, v oc dk/dX. Thus, GWs from any MLDC source are completely 
specified by /3, A, and by the two functions h + (£) and h y (^) for the source GW 
polarization amplitudes, measured at the SSB. 

* In the version of Synthetic LISA available at the time of releasing Challenge 2, the polarization 
tensors have opposite signs with respect to and to the LISA Simulator convention. This minus 
sign cancels out the sign difference in the definition of the basic phase and frequency measurements. 
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Table 3. GalacticBinary source parameters. Note that Amplitude effectively 
replaces the standard Distance parameter. 



Parameter 


Symbol 


Standard parameter name 
(lisaXML descr.) 


Standard unit 
(lisaXML descr.) 


Amplitude 


A 


Amplitude 


1 (GW strain) 


Frequency 


f 


Frequency 


Hertz 


Initial GW phase 


<t>0 


InitialPhase 


Radian 



The orbital orientation of nonprecessing binaries is described by the inclination 
i (the angle between the line of sight k and the orbital angular momentum of the 
binary), and by their polarization angle ip: specifically, if h+(£) and /ix(0 are the 
binary's GW polarizations in its source frame (i.e., the polarizations defined with 
respect to the binary's principal polarization axes p and q) then 

MO + ih* (0 = e" 2 ^ [hi (0 + ih s x (£)] , (5) 
with ip = — arctan(i> ■ p/u ■ p). Together with (3, A, and with the luminosity distance 
D, l and ip form a set of common standard parameters, listed in table [2] with their 
standard lisaXML descriptors (see [6] for a description of the XML files adopted for 
the MLDCs). 

4-2. Galactic stellar mass binaries 

In Challenges 1 and 2, a Galactic stellar mass binary system with component masses 
mi and mi at a distance D is modelled as a system of two point masses in circular 
orbit with constant period. The source-frame polarization amplitudes are given by 

h%{£) =A(l + cos 2 l) cos(27r/^ + 0o), (6) 

h 8 x (f) = - 2.4(cos t) sin(2^/e + <t>o), 
where the amplitude is derived from the physical parameters of the source as A = 
(2n/D)(TrMf) 2 / 3 , with M = mi + »Ti2 the total mass, and /i = m^m-ifM the reduced 
mass; (f> is an arbitrary random initial phase. Notice that / is constant in the SSB, 
but not in the final LISA data set, because of the Doppler shifts induced by the LISA 
orbital motion. A Challenge-2 GalacticBinary source is completely determined by 
the parameters listed in tables [2] and [3] 

Data sets 2.1 and 2.2 contain 25 "verification binaries," defined as systems whose 
location and orbital period are exactly known (conceptually from electromagnetic 
observations); the remaining four parameters are selected randomly. We consider five 
real known systems and then select 20 more from a synthetic model of the Galaxy (see 
below). In addition to verification binaries, about 26 million white-dwarf binaries are 
included from the population-synthesis model. The same set of stellar mass binaries 
have been used in the training sets for Challenges 2.1 and 2.2 to allow participants to 
compare the source recovery with and without signals of other types being present. 
Different realizations of the stellar-mass binaries are used in each of the blind data 
sets. 

4-3. Galactic model 

The Challenge-2 model of the Galaxy is derived from a modern population-synthesis 
code [19] , and contains some 26 million white-dwarf binaries. Rather than re-running 
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the simulation each time a new realization was needed, we used a single simulation 
ouput, and created new versions by tweaking the frequencies of the binaries by a 
random amount (of order 3 /xHz), and by randomly drawing new values for the 
inclination, polarization and initial phase. These perturbations are large enough to 
make the simulated galaxies distinct from the perspective of data analysis, yet small 
enough to leave the population unaffected in an astrophysical sense. 

Since the LISA Simulator and Synthetic LISA take a few minutes to process 
a single source, it is not practical to use them to generate the response to a full 
galactic background. Instead we used a new simulation tool [5U] that was designed 
especially to model slowly evolving signals. The output of this new tool has been 
checked against the LISA Simulator and Synthetic LISA, and the results agree very 
accurately. However, the dedicated code is able to process the entire Galaxy in just a 
few hours on a single processor. 



4-4- Massive-black-hole binaries 

In Challenges 1 and 2, we restrict massive-black hole binaries to having circular orbits 
and nonspinning black holes, and we consider only the inspiral phase of the whole 
coalescence process. We model the inspiral at the restricted second post-Newtonian 
(2PN) order, following [211 [22]. I n terms of the rescaled time 

where t c is the time at coalescence, and -q = [i/M, the time evolution of the orbital 
angular frequency u> and phase $ is given at 2PN order by 

Mu = 




14450688 



(8) 



and 



$ = —(Muj 

32?7 V 




1016064 



(Muj) 2/3 - IOtt(Mw) 



W 4/3 ■ (9) 



The phase of the polarization functions is then computed according to $(t) — $(t = 
0) + $0, where $o 1S an arbitrary initial orbital phase: 

h S + (0 = ^[M^(e)] 2 / 3 (l + cos 2 Ocos[2$(0], (10) 

^(0 = - ^[M^(e)] 2/3 (2cos0sin[20>(e)]. (11) 

In order to avoid numerical artifacts at the end of the inspiral (taken to be the last 
stable orbit), we taper the signal according to 

w{t) = 1 (1 - tanh [A(M/R - M/R^)]) , (12) 

where R is approximated with Kepler's law (R = M 1 / 3 ^" 2 / 3 ), and where the 
dimensionless coefficient A — 150 was determined empirically to produce smooth 
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Table 4. BlackHoleBinary source parameters. Note that the tapering radius was 



fixed at R = 7M for Challenge 2. 


Parameter 


Symbol 


Standard parameter name 


Standard unit 






(lisaXML descr.) 


(lisaXML descr.) 


Mass of first BH 


mi 


Massl 


SolarMass 


Mass of second BH 


7T12 


Mass2 


SolarMass 


Time of coalescence 


t c 


CoalescenceTime 


Second 


Angular orb. phase 


$(, 


InitialAngularOrbitalPhase 


Radian 


at time t — 








Tapering radius 


R 


TaperApplied 


TotalMass 



damping; i?t a por is set to 7M. The lisaXML standard parameters for Challenge-2 
BlackHoleBinary sources are listed in tables [2] and 2J 

4-5. Extreme-Mass-Ratio Inspirals 

Extreme-Mass-Ratio Inspirals - compact objects (CO) orbiting a MBH - constitute 
the new source class included in the second round of MLDCs. For these, we adopt the 
Barack-Cutler "analytic kludge" waveforms [23], where orbits are instantaneously 
approximated as Newtonian ellipses and gravitational radiation is given by the 
corresponding Peters-Matthews formula [24] . but perihelion direction, orbital plane, 
semi-major axis and eccentricity evolve according to post-Newtonian equations. While 
these waveforms are not particularly accurate in the highly relativistic regime of 
interest for real EMRI searches, they do exhibit the main qualitative features of the 
true waveforms, and they are considerably simpler to generate. They are therefore 
ideal to develop and test search strategies. It is expected that any strategy that 
works for them could be modified fairly easily to deal with the true general-relativistic 
waveforms, once these become available. 

In general relativity, the full two-body system is described by 17 parameters. Here 
we assume that the spin of the CO is negligible with respect to the MBH spin and can 
therefore be ignored: the signal is then described by 14 parameters, which we now list. 
Let us consider a CO of mass /i (<C M) orbiting a MBH of mass M on an orbit with 
semi-major axis a, eccentricity e and orbital frequency v. At the Newtonian order, 
v = (27rM)~ 1 (M/a) 3 / 2 , and the orbital mean anomaly <£> (i.e., the average orbital 
phase with respect to the direction of pericenter) is $(i) = 2nv(t — to) + $o, where $o 
is the mean anomaly at to. The spin S of the MBH is parametrized by its magnitude 
S (so that < S/M 2 < 1) and by the two polar angles 6 K , <t>K in the SSB (here "K" 
stands for Kerr). L(t) represents the time- varying orbital angular momentum: its 
direction is parametrized by the constant angle A (not to be confused with the ecliptic 
longitude, defined in Sec l4.ip between L and §^ and by an azimuthal angle a(t) that 
describes the direction of L around S. The angle ^(t) is the (intrinsic) direction of 
pericenter, measured with respect to L x S. Fixing the initial time to (as measured 
in the SSB), v , eo, 70, and $0 describe, respectively, the eccentricity, the direction of 
the pericenter within the orbital plane, and the mean anomaly at to- More specifically, 

jt In reality, radiation reaction will impose a small time variation in A; however, this variation is 
known to be very small 25 and is neglected here. 
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7o is the angle in the plane of the orbit from L x S to pericenter, and <&o is the mean 
anomaly with respect to pericenter passage. We refer the reader to figure 1 of [S3] for 
a graphic representation of the system. 

It is useful to describe the GW tensor in a time- varying frame defined with respect 
to the unit vectors n = —k (pointing from the origin of the SSB to the source) and 
L(t). Defining the unit vectors p and q by 

p= (h x L)/\n x L\, 

q = p x h, (13) 

the general GW strain field at the SSB can then be written as 

HO = K(0 [P ® P - q ® q] + hx(0 [P ® q + q ® p] ■ (14) 

Notice that the polarization tensors [p <£> p — q ® q] and [p ® q + g ® p] are functions 
of time. One can decompose (|14j) into harmonic contributions at integer multiples of 
the orbital frequency 

h+ = j2 A i^ h x =j2K, (15) 

n n 

where the A+ ,x are 

A+= - [I + (L ■ n) 2 ] [an cos(2 7 ) - b n sin(2 7 )] + [1 - (L ■ n) 2 ]c ni 

Al = 2(1 • n) [b n cos(2 7 ) + a n sin(2 7 )] . (16) 
The harmonics A+' x are expressed as a function of the coefficients 

a„ = - nA[J n - 2 {ne) - 2eJ„_i(ne) 

+ (2/n)J n (ne) + 2eJ n+1 (ne) - J n+2 (ne)] cos[n$(<)], (17) 

b n = - nA(l - e 2 ) 1/2 [J n -2{ne) - 2J n (ne) + J n +2{ne)] sin[n$(t)], 

c„ = 2^.J n (ne)cos[n$(t)]. 

Here the J n are the Bessel functions of the first kind, and 7 is an azimuthal angle 
measuring the direction of pericenter with respect to x = [— h+L(L-h)]/[l — (L-n) 2 ] 1 / 2 . 
The angle 7 is related to 7 by 

7 = 7 + A (18) 
where (3 is the angle from x oc [L(L • h) — h] to (L X S), as given by 

. „ (cos \)L ■ n — S ■ h „ h-(SxL) 
sin/3 = ; - , cos/3 = v ; (19) 



(sinA)^/!-^-?!) 2 (sinA)yi - (i • n) 2 

The overall amplitude is given by 

^= (2^Af) 2 / 3 -^. (20) 

In practice, we truncate the sums in (|15p at n = 4 when e(t) < 0.136, otherwise at 
n = 30e(i). 

The angular- momentum direction vector L is not constant, since L precesses 
about the MBH spin direction 5. Let 9l{£), ^i(t) be the angles specifying the 
instantaneous direction of L, and let z be a unit vector normal to the ecliptic. One 
then obtains 

t ?i z — ScosOk . Sxz , 

L = d cos A H smAcosa-l — sin A sin a, (21) 

sin 9 k sin 9 k 
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and the angles 9^{t), 0l(£) are given in terms of 9k, 4>Ki A. an( i a {t) by 
cos#z,(i) = cos 6>k cos A + sin 9 k sin A cos a (t), 

sin £, (i) cos </>l (f ) = sin #k cos 0k cos A + sin 4>k sin A sin a(t) 

— cos 0k cos 9k sin Acosa(i), (22) 
sin #£,(£) sin(/>i(t) = sin#K sin 0k cos A — cos 4>k sin Asina(i) 

— sin k cos 0^ sin A cos a(i). 
To work with these expressions one can exploit the relations 

S ■ n = cos 9s cos 9k + sin 9s sin Ok cos(0s — 0k), (23) 



- / A fx . n ■ r \ ■ S ■ n cos 9 k — cos 9s . . . ,_, 
n • (6 x L) = sin #5 sm(0K — 0s) sin A cos a H : — sm A sin a, (24) 



sm f k 



and 



(25) 



t „ a „ , cos 9s — S ■ n cos 6>k . , Sxz Vti . , , 

L ■ n = b> ■ n cos A H sm A cos a H sm A sm a, 

sm 9 k sm pk 

= cos#s cos^l + sin^s sin#L cos(0s — 0i). 

Note that the time-variation of S ■ n is very small in the extreme mass-ratio case 
considered here, and in our kludged approximation we approximate 5* - and hence 
S • h — as strictly constant. 

The evolution of <&(£), v{t), ~f(t), e(t), and a(t) is given by the PN formulas 



dt 



(26) 



1 73 2 37 4 
1 + 24 C + 96 C 



:i-e 2 ) 



(2ttA/^) 2/3 



1273 2561 2 3885 4 13147 fi 
■e - — — -e 4 - — — — e b 



336 224 



128 



5376 



S 



(2 7 rA^)^(cosA)(l-e 2 )- 1 / 2 



73 1211 , 3143 . 65 fi 
e 2 + -rr-e 4 + — e b 



12 24 



64 



(27) 



de 



= 6iris(2nvM) 2/3 (l - e 2 )" 1 1 + (2wvM) 2/3 (l - e 2 )^ 1 (26- 15e 2 )/4 

- 127r^(cosA)^|r(27rMi/)(l - e 2 )~ 3/2 , 
M 2 



(28) 



dt 15M 2V ' 1 ; 



(304+ 121e 2 )(l - e 2 )(l + 12(2ttM^ 



\2/3 



— (2-kMv) 2/3 ( 133640 + 108984 e 2 - 25211 e 4 
56 V 



+ e ^A^ (C ° SA)(2 " M " )11/3(1 " e2r4 
4™A( 2 .M,)(1 -e 2 )- 3 / 2 . 



1364 5032 2 263 , 
e 2 + — r-e 4 



15 



10 



(29) 



(30) 



For a point particle in Schwarzschild, the plunge occurs at a m i n = M (6 + 2e) (1 — e 2 ) 1 
[26] , so we set 

iW = (2 7 rM)- 1 [(l - e 2 )/(6 + 2e)] 3 / 2 , (31) 
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and we shut-off the waveform when v reaches this y max . 

Equation (| 14[) expresses the waveform in terms of time-varying polarization 
tensors, but to generate the LISA responses it is necessary to re-express it in terms 
of fixed polarization tensors. This is achieved through a rotation by the polarization 
angle 

/ cos 6> s sin 6> L cos(0 s - <p L ) - cos6» L sin 0s \ 

if; = arctan , . — . (32) 

V sin 0l sin(0s - (pL) J 

The algorithm for constructing the MLDC EMRI waveforms works as follows. 
We fix a fiducial frequency v§ at to = and choose the waveform parameters 
fi, M, S/M 2 , e , 70, $o, cos 9s, <j>s, cos A, a , cos 8k, §k and D. We then (i) solve 
the ODEs j26])-(lSni for v(t), j(t), e(t), a(t); (ii) use e(t) and z/(t) to calculate 

an{t),b n (t),c n {t) in (fT7|: (iii) calculate 0z,(i), 4>L(t) using ([22jl . and then obtain j(t) 
from j(t) using p^ |) -([T9 |) : (iv) compute the amplitude coefficients A+ |X using (fl6|) 
and ((25|) ; (v) calculate ?/> using (|32|) : (vi) finally, compute h + ^ y (t) fron jffl 

h+(t) = A + (t) cos2tp(t) + A x (t) am2t()(t), 

h x (t) = -A+(t)sm2ijj(t) + A x (t)cos2ijj{t). (33) 

The lisaXML standard parameters for Challcngc-2 ExtremeMassRatioInspiral 
sources are listed in tables [2] and 



5. Conclusions 



The Mock Data Challenges (MLDCs) are aimed at stimulating the development and 
demonstrating the technical readiness of LISA data-analysis capabilities. The first 
round of MLDCs has just been completed and the second round data sets have been 
released. The latter provide a fairly realistic testbed for the data-analysis challenges 
that are novel to LISA (as opposed to ground-based detectors): in particular, the 
problem of detecting EMRIs and the global aspect of analyzing a multitude of 
simultaneous, overlapping signals. 
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